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Abstract 

Recently, Mike and Farmer have constructed a very powerful and realistic behavioral model 
to mimick the dynamic process of stock price formation based on the empirical regularities 
of order placement and cancelation in a purely order-driven market, which can success- 
fully reproduce the whole distribution of returns, not only the well-known power-law tails, 
together with several other important stylized facts. There are three key ingredients in the 
Mike-Farmer (MF) model: the long memory of order signs characterized by the Hurst index 
Hg, the distribution of relative order prices x in reference to the same best price described 
by a Student distribution (or Tsallis' g-Gaussian), and the dynamics of order cancelation. 
They showed that different values of the Hurst index Hs and the freedom degree of 
the Student distribution can always produce power-law tails in the return distribution /(r) 
with different tail exponent a^.. In this paper, we study the origin of the power-law tails of 
the return distribution /(r) in the MF model, based on extensive simulations with different 
combinations of the left part fhi^) for x < and the right part fR{x) for x > of f{x). 
We find that power-law tails appear only when fhi^) has a power-law tail, no matter fR{x) 
has a power-law tail or not. In addition, we find that the distributions of returns in the MF 
model at different timescales can be well modeled by the Student distributions, whose tail 
exponents are close to the well-known cubic law and increase with the timescale. 
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1 Introduction 



Many stylized facts have been unveiled in different stock markets 111, |2|, |3|] . Un- 
derstanding the underlying regularities causing stylized facts are crucial in stock 
market modeling. Three different families of market models exist aiming at repro- 
ducing the main stylized facts. The first family is the dynamic models, such as the 
multifractal model of asset returns [4], which was later extended in several publica- 
tions iSfilvllsl], and the multifractal random walks iSEEil]. The second family is 
the agent-based models (or multi-agent models), in which agents buy or sell shares 
according to some rules and the price variations are determined by the imbalance 
of demand and supply. There are different types of agent-based models, such as 
percolation model^Il H H H 0, [lilsing models H H S [zE 0, 
minority games iSTQ, Q, S S 12^1301 M\, and others H. The minority 
games are among the most important agent-based models and thus many variants 
have been proposed. The third family is the order-driven models, where researchers 
attempt to simulate the dynamics of order books. The price in order-driven models 



changes based on the continuous double auction (CDA) mechanism [1331 l34l 13511. 
A nice review of order-driven models was recently given by Slanina [|36n . We can 
think of the agent-based models and the order-driven models as microscopic mod- 
els for quote-driven markets and order-driven markets, respectively. 

Recently, Mike and Farmer have constructed a very powerful and realistic behav- 
ioral model to mimick the dynamic process of stock price formation OTII . which 
belongs to the third family. We call it Mike-Farmer model, or MF model for short. 
It seems undoubtable to us that the MF model is a milestone in the modeling of 
order-driven markets, which will prove to introduce an important improvement in 
asset derivative pricing and risk management. Having said this, we stress that the 
MF model is still very simple as mentioned already by Mike and Farmer BTII and 
there are still a lot of open problems to be addressed. Indeed, the MF model pro- 
vides a nice platform to unravel the origin of stylized facts of stocks in order-driven 
markets. The essential advantage of the MF model is that it is constructed based 
on the empirical regularities of order placement and cancelation in a purely order- 
driven market, which can successfully reproduce the whole distribution of returns, 
not only the well-known power-law tails, together with several other important styl- 
ized facts. There are three key ingredients in the MF model: the long memory of 
order signs characterized by the Hurst index Hg, the distribution of relative order 
prices x in reference to the same best price described by a Student distribution, and 
the dynamics of order cancelation. 

Through extensive simulations, Mike and Farmer found that different values of the 
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Hurst index Hg and the freedom degree ax of the Student distribution can always 
produce power-law tails in the return distribution /(r) with different tail exponent 
ttr. Specifically, they found that increases almost linearly with for fixed Hg 
and decreases approximately linearly with Hg for fixed a^. Our simulations of the 
MF model with different values of ax (ranging from 0.9 to 1.9 with a step of 0.1) 
and Hs (ranging from 0.1 to 0.9 with a step of 0.1) confirm this finding. Speaking 
differently we have simulated 9x11 versions of the MF model. For each version, 
four million simulation steps are conducted after removing the initial transient data 
and we obtain about one million data points of trade-by-trade returns. We find that 
the dependence of upon Hg and ax can be modeled using the following formula: 

ar = 0.61 + 2.050^; - O.llif, - 0.34i/,a^ . (1) 

These observations can be explained as follows. With the increase of Hg, the mem- 
ory of order signs becomes stronger and more orders of the same direction (buy or 
sell) are placed successively. This results in more large price fluctuations and the 
decay of /(r) becomes slower. Hence the tail exponent decreases. On the other 
hand, if a^. is small, more passive orders with x < are placed deep inside the 
order book and the standing volumes close to the best ask or bid price are relatively 
small. Speaking differently, the depth of the order book is low and the liquidity is 
low. Also, there are more aggressive orders with x > placed resulting in more 
market orders. Both effects lead to more large price fluctuations and slower decay 
of the return distribution. 

Although both the strength of the long memory of order signs and the tail exponent 
of relative order prices have significant influence on the distribution of returns, it 
is unclear what causes the power-law tails in the MF model. In this paper, we will 
address this question based on extensive simulations with different combinations 
of the left part /l(x) for x < and the right part //? (x) for x > of /(x). We find 
that power-law tails appear only when /i,(x) has a power-law tail, no matter /i?(x) 
has a power-law tail or not. Moreover, we find that the return distributions in the 
MF model at different timescales can be well modeled by the Student (or Tsallis' 
g-Gaussian) distributions, whose tail exponents are close to the well-known cubic 
law and increase with the timescale. 



2 Description of the Mike-Farmer model 

In purely order-driven markets, the main trading mechanism is the continuous dou- 
ble auction. Passive traders are patient and place effective limit orders that are 
stored in the order book waiting for execution, while aggressive traders are inpa- 
tient and submit effective market orders that are executed immediately. Consider a 
limit order placed at event time t whose logarithmic price is 7r(t). Denote 7r„(t — 1) 
and ni,{t — 1) the logarithms of best ask and bid prices right before t. A buy limit 



3 



order with 7r(t) ^ Hait — l) or a sell limit order with n(t) ^ nb(t — l) is classified as 
an effective market order. Speaking differently, orders with the relative prices x{t) 
less than the preceding spreads S{t — 1) = TVa^t— 1) — 7r{,(t — 1) are effective limit 
orders, while orders with x{t) ^ S{t — 1) are effective market order^. Orders 
waiting on the limit order book are either satisfied by future effective market or- 
ders or canceled. Therefore, the continuous double auction can be simulated if one 
knows the regularities governing the dynamic processes of order placement and 
cancelation. Most order-driven models also follow this line. However, to the best 
of our knowledge, the MF model is the only one that uses empirical regularities of 
order placement and cancelation extracted from real stock data. This is the reason 
why the MF model can reproduce the cubic law of return distribution without tun- 
ing any model parameters. Actually, the MF model does not introduce any artificial 
tunable model parameters at all. All parameters in the MF model are determined 
empirically and have clear financial meanings. The regularities of order placement 
and cancelation may be different for different stock markets. The MF model can be 
easily modified for other markets. 



When placing an order, the trader needs to determine its sign for buys and 

"— 1" for sells), size and price (or the relative price x). In the MF model, all or- 
ders are assumed to have identical size. The signs of successive orders have strong 
memory, which can be characterized by a rather large Hurst index Hg close to 0.8 
or even larger [1371. 13811. This finding is conclusive without any controversy. In con- 
trast, the distribution of relative prices seems different in different markets. Zovko 
and Farmer studied the unconditional distribution of relative limit prices defined 
as the distance from the same best prices for orders placed inside the limit-order 
book ll39n . They merged the data from 50 stocks traded on the London Stock Ex- 
change (August 1, 1998 to April 31, 2000) and found that the distribution decays 
roughly as a power law with the tail exponent a = 1.5 for both buy and sell orders. 
Bouchaud et al. analyzed the order books of three liquid stocks on the Paris Bourse 
(February 2001) and found that the relative price of new orders placed inside the 
book follows a power-law distribution with the tail exponent a = 0.6 ll40ll . Potters 
and Bouchaud investigated the relative limit price distributions for inside-the-book 
orders of three Nasdaq stocks (June 1 to July 15, 2002) and found that the distri- 
butions exhibit power-law tails with an exponent a = 1 114 ill . Maskawa analyzed 
13 rebuild order books of Stock Exchange Electronic Trading Service from July to 
December in 2004 on the London Stock Exchange and found that the limit prices 
for all orders inside the book are broadly distributed with a power-law tail whose 
exponent is a = L5 [42], which is consistent with the results of Zovko and Farmer 
II39I1 . He also presented the distribution in the negative part for more aggressive 
order outside the book and found that the negative part decays much faster than 
the positive part. Mike and Farmer focused on the stock named AZN and tested on 
24 other stocks listed on the London Stock Exchange (LSE) [370. They found that 



^ The relative price x(t) is defined as 7r(t) — 7:i,{t — 1) for buy orders and 7ra{t — 1) — 7r(t) 
for sell orders. 
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the distribution of relative logarithmic prices can be fitted by a Student distribution 
with a = 1.0 — 1.65 degrees of freedom and the distribution is independent of bid- 
ask spread at least over a restricted range for both buy and sell orders. Gu, Chen and 
Zhou analyzed 23 Chinese stocks traded on the Shenzhen Stock Exchange (SZSE) 
and found that the distribution of relative prices is asymmetric n43\\ . They showed 
that the distribution has power laws with the exponents greater than 1 and lower 
than 2. 



There are also efforts to seek for factors influencing order placement. Using 15 
stocks on the Swiss Stock Exchange, Ranaldo found that both bid-ask spread and 
volatility negatively relate to order aggressiveness [|44ll . Lillo analyzed the origin 
of power-law distribution of limit order prices considering the order placement 
as an utility maximization problem considering three factors: time horizon, util- 
ity function and volatility ll45ll . He found that the heterogeneity in time horizon is 
the proximate cause of the asymptotic power-law distribution, while heterogeneity 
in volatility is hardly connected with the origin of power-law distribution. Mike and 
Farmer found that the distribution f{x) of LSE stocks is independent of the bid-ask 
spread [|37ll . which was confirmed by Gu, Chen and Zhou using SZSE stocks [431]. 



In a zero intelligence model 11341. l35h. order cancelation is assumed to be a Poisson 
process. Alternatively, Mike and Farmer found that the conditional probability of 
canceling an order i at time t is influenced by at least three factors OTII : the ratio yi 
of current relative price of an order to its original relative price when it is placed, 
the number ntot of orders in the order book, and the order book imbalance n^nh that 
is defined as the ratio of the number of buy (or sell) orders to Utot- By assuming 
that Hi, ntot and rzimb are independent, the conditional probability of cancelation per 
order has the following form: 



P{Ci\yi,ntot,nimh) = A{1 - e ^Ol'^imb + B)/nt, 



■ Ot ) 



(2) 



where the parameters A and B are determined empirically using real data of indi- 
vidual stocks [371. 



Now we can describe the MF model as follows. We stress that the sizes of orders are 
set to unity. In each round of the simulation, we simulate 2x10^ steps and the first 
2000 data points are discarded from the ensuing analysis. We repeat this process 20 
times, which results in four million orders and about one million transactions. In 
each round, we generate two arrays of the relative prices x{t) and the order signs 
s{t). The sign array of the orders is generated from a fractional Brownian motion 
with Hurst index Hg and the relative price x is taken from a Student distribution 
with scale and degrees of freedom. At each simulation step or event time 
t, an order is generated, which is characterized by x{t) and s(t). When x{t) ^ 
S{t — 1), the order is executed and a buy limit order (if s(t) < 0) or a sell limit 
order (if s{t) > 0) at the best bid or ask price is removed from the order book. 
When x{t) < S{t — 1), the order is stored in the order book at the price level 
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Tint[7r(t)/T], where T is the tick size, int[2;] is the largest integer smaller than z, 
and 7i(t) = x{t) + 7ii,{t — l) for buy orders or 7i(t) = 7ra{t — l) — x{t) for sell orders. 
We then calculate the values of -P(Cj|yj, ritot, '^imb) for all orders waiting in the 
order book. A random number is drawn from a uniform distribution defined on 
the interval [0, 1]. All orders with P{Ci\yi, ntot, ''^imb) ^ pit) are canceled from the 
order book. The returns between successive trades are used in this work. We note 
that the results presented in n31\\ are successfully reproduced in our simulations. 



3 The origin of power-law tail of returns 



3. 1 Methodology 



Mike and Farmer have shown that the power-law tails in the return distribution 
/(r) become heavier if the long memory in the order signs is stronger [37]. We 
notice that the power-law tails do not vanish even when Hg = 0.5. In other words, 
long memory in the order signs cannot explain the emergence of power-law tails of 
returns. Therefore, we turn to investigate the influence of /(x) on /(r). According 
to the setting of the MF model, the shape of /(x) for x > S does not impact the 
shape of /(r), since those orders are effective market orders that always remove 
one unit of shares from the opposite side of the order book, despite of the true 
prices of the incoming effective market orders. We thus speculate that the right part 
of f{x), denoted as fR{x), has much weaker influence than the left part fiix)- In 
our simulations, the tick size is T = 3 x 10"'' and three model parameters are fixed 



according to UM- = 0.8, A = 1.12, and B = 0.2. 



In order to unveil the effect of the two parts of f{x) on the tail behavior of /(r), we 
adopt different formulae for f{x). In general, we can write the following 

fix) = ^' ^ . (3) 

[fR{x), x^O 

Obviously, when x = 0, we require that 

/l(0) = fniO) . (4) 

In the MF model, f{x) = /l(x) = /r^x). In this section, we will use different 
functional forms for fL{x) and fnix). 



The first class is the Student density Hw or Tsallis' g-Gaussian ll47h . whose density 
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IS 

2 , , ax+1 



fqci^Wx, L) = X (ax + Lx"^) ' , (5) 



,2' 2 

where a.^ is the degrees of freedom parameter (or tail exponent), L is the scale 
parameter, and B{a, h) is the Beta function, that is, B{a, h) = T(a) T{b) / T(a + h) 
with r(-) being the gamma function. The second class is the Laplace distribution 
or double exponential distribution, whose density is 

fDE{x\X) = ^e-^l^l , (6) 
and the third class is the normal distribution, whose density is 

fG{x\a) = -^e-^'/'^' , (7) 



According to the symmetry of the distribution of x in the MF model 113711 . the mean 
of X is fixed to null in all these distributions. 

In the model specification, we choose fL{x) and from fgc, foE and fc, 

respectively. The constraint (H]) can be specified for different combination of 
and fR{x) as follows. When the two parts of f{x) are fgc and foE, we have 



2^LK = Afi(l/2,a,/2) . (8) 
When the two parts of f{x) axe f^c and fa, we have 



B{l/2,aj2) = a^27iL/ax. (9) 
When the two parts of f{x) axe foE and fc, we have 

AV = 2/7r, (10) 
which is equivalent to the combination of Eq. ([8]) and Eq. 

We fix ax = 0.0024 in all the simulations so that the simulated sample of x is 
comparable to real data ll37ll . The values of L in Eq. © are calculated as follows: 

L = ax/[{l + ax)al] . (11) 

The values of A and a axe determined respectively according to Eq. ([8]) and Eq. Q 
for different values of a^. We investigate different combinations of fhix) and //^(x) 
according to if they have a power-law tail. The case that both parts have a power-law 
tail has been studied by Mike and Farmer ['jt'], as discussed in Section[T] Therefore, 
we are left with three cases: (1) There are no power-law tails in f{x) \ (2) The right 
part of f{x) has a power-law tail with = fqG{x\ax, L); and (3) The left part 

of f{x) has a power-law tail with /l(x) = fqG{x\axi L). We shall investigate these 
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three cases in the rest of this section. 



3.2 Case 1: There are no power-law tails in f{x) 

In this case, there are no power-law tails in f{x) and we have four combina- 
tions for and that are {fDE{x\X),fGix\a)}, {/d£;(x|A), /Di?(x|A)}, 
{fcixla), fDEix\\)}, and {fG{x\a), /g(x|(t)}. For each combination, we investi- 
gate five different values of ax from 1.1 to 1 .9 with a step of 0.2. For each value of 
a^, our simulations are conducted for 20 repeated rounds. In each round, 2x10^ 
incoming orders are generated, driving the trading system evolve, and the first 2000 
orders are excluded from analysis. The complementary cumulative distribution can 
be determined for each in every combination. We show the tails of F(|r|), which 
is the complementary cumulative probability distribution of |r | , since the return dis- 
tributions are almost symmetric for positive and negative returns. 

The resultant 20 empirical distributions of the trade-by-trade absolute returns |r| 
for each combination are given in Fig.flja). It is evident that there is no power-law 
tails observed in the distributions. For each combination, the distribution -F(|r|) de- 
cays faster for larger a^. We also find that there are more large returns for the two 
combinations where fi = {fDE{x\)^)- This is explained by the two facts that fiix) 
has stronger impact on -F(|r|) or equivalently /(r) and that a Laplace distribution 
has heavier tails than a Gaussian. Fig. [TJb) presents the distributions of the stan- 
dardized returns g = [r — iJi.,)/ar, where yUr-(~ 0) and are respectively the mean 
and the standard deviation of r. It is interesting to observe that the five distributions 
collapse onto a single curve for all four combinations. 



Fig. 1. (Color online) Resultant distribution of the trade-by-trade absolute returns |r| (a) 
and the corresponding standardized returns \g\ (b) when the relative price distribution /(x) 
has no power-law tails. Each cluster of curves con^esponds to a combination of and 
The clusters have been shifted vertically for clarification. The associated combina- 
tion {/l(x), from top to bottom is {fDE{x\X), /g(x|o-)}, {/de(x|A), fDE{x\\)}, 
{/g(x|(t), //)£;(x| A)}, and {/G(x|cr), /(^(xlcr)}, where A and cr are determined respec- 
tively according to Eq. ([D and Eq. ^ for different values of ax- 
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3.3 Case 2: The right part of f{x) has a power-law tail with Jr^x) = fgci^lax, L) 

In this case, the right part of f{x) has a power-law tail with = fqcixla^, L) 

and we have two combinations for and/ii;(x): {fDE{A^)^fqG{A(^x,L)} and 
{fG{x\a), fqG{x\ax, L)}. The simulation procedure is the same as in Section [3^ 
The resultant 10 empirical distributions of the trade-by-trade absolute returns |r| 
for each combination are given in Fig.|2a). Again, no power-law tails are observed 
in these distributions. For each combination, the distribution decays faster 

for larger a^.. We also find that there are more large returns for the the combina- 
tion where fi = {fDE{x\X)}. The same explanation applies. Fig. [2l;b) presents 
the distributions of the normalized returns \g\ and the five distributions for each 
combination collapse remarkably onto a single curve. 




Irl \g\ 



Fig. 2. (Color online) Resultant distribution of the trade-by-trade absolute returns |r| (a) 
and the corresponding normalized returns \g\ (b) when the right pait of f{x) has a pow- 
er-law tail, that is, fR{x) = fqG{x\ax,L). The upper cluster of curves corresponds to 
= /de(x|A) and the lower cluster coiTcsponds to fhix) = fG{x\(y), where A and a 
are determined respectively according to Eq. ^ and Eq. ^ for different values of ax- The 
lower cluster has been shifted downwards for clarification. 

3.4 Case 3: The left part of f{x) has a power-law tail with = fqGix\ax, L) 

In this case, the left part of f{x) has a power-law tail with fiix) = fqG{x\ax, L) 
and we have two combinations for /^(x) and/R(a;), which are {fqG{x\ax, L), foE{x\^)} 
and {fqG{x\ax^ L), fG{x\a)}. The simulation procedure is the same as in Section 
13.21 The resultant empirical distributions of the trade-by-trade absolute returns |r| 
for each combination are depicted in Fig.[3];a) and (b), respectively. Nice power-law 
tails are observed in all the distributions. For each combination, the tail exponent 
a J. increases with a^. Comparing the distributions in the two plots, no significant 
difference can be identified in the return distributions with the same value of Ux. 
In other words, the shape of or equivalently /(r) is fully determined by 

fnix) = fqciA^xi L). It is not out of expectation that there is no scaling in the 
distributions of the normalized returns, F{\g\), and we thus do not show them here. 
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Fig. 3. (Color online) Resultant distribution of the trade-by-trade absolute returns |r| when 
the left part of f{x) has a power-law tail, that is, = fqci^lo^x, L). The right part 

of f{x) is = fDE{x\X) in (a) and /r(x) = /g(x|(7) in (b), where A and a are 

determined respectively according to Eq. dUl and Eq. ^ for different values of Ox- 

In summary, our simulations confirms that the return distribution is mainly deter- 
mined by the distribution of the relative prices of incoming orders placed within 
the order book with x ^ 0. Heavier tail in will result in heavier tails in /(r). 
Only when has a power-law tail, /(r) will have power-law tails. 



4 The return distributions at different timescales 



We now investigate the return distributions at different timescales for the MF model 



with the parameters being extracted from real data [|37|]. Specifically, the values of 
model parameters used in this section are the following: ax = 1.3, ax = 0.0024, 
Hs = 0.8, A = 1.12, and B = 0.2. We stress that x follows the Student distribution 
in the standard MF model. The simulation procedure is the same as described in 
the previous section. We consider the normalized returns rather than the returns per 
se for convenience. 

We adopt the mid-price of the best bid 7ib{t) and best ask Tiait) as the logarithmic 
price at time t after a transaction occurs: 

/(*) = ^-M±^ , ,12) 

where t is the event time corresponding to single trades. The event-time return after 
At trades is then defined as the logarithmic price change: 

rAtit)=Iit)-Iit-At). (13) 

Here we deal with the standardized returns 

9Atit) = [rAtit) - ^ At]/ (7 At , (14) 
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where fiAt and a At are respectively the mean and the standard deviation of returns 
r^t- For simplicity, we drop the subscript r^t below. 



4. 1 Probability distributions of trade-by-trade returns 



We first focus on At = 1. The empirical probability density function f{g) is esti- 
mated, as shown in Fig. Ufa). We find that f{g) can be well modeled by a Student 
density. Nonlinear least-squares regression gives = 2.9 and L = 3.3. The fitted 
curve is drawn on the left panel. According to Fig. Ufa), the Student density fits 
nicely the tails of the empirical density f{g). The fitted model deviates from the 
empirical density remarkably for small values of l^fj. If we amplify the central part 
for small \g\ with finer binning, the shape of f(g) looks like a Mexican hat. This is 
the very character of return distributions of individual stocks caused by the discrete- 
ness of price changes in units of tick size. This intriguing structure was reported for 
common stocks in the US market ll48n and in the Chinese market [|430 . 




^10 
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Fig. 4. Empirical probability density function of the normalized trade-by-trade returns. 
Panel (a): Empirical probability density function f{g) of the normalized returns g. The 
solid line is the Student density with = 2.9 and L = 3.3. Panel (b): Empirical cumula- 
tive distributions for positive and negative normalized returns g. The solid lines are 
the least squares fits of power laws to the data with = 2.87 it 0.02 for the positive tail 
and a~ = 3.06 it 0.03 for the negative tail. 



fig) 



(15) 



For large values of \g\, the Student density function f(g) approaches power-law 
decay in the tails: 

for ^ < 

The empirical cumulative distributions F{\g\) of positive g and negative g are illus- 
trated in Fig.|5b). Both positive and negative tails decay in a power-law form with 
+ — 2.87 ± 0.02 and = 3.06 ± 0.03, which are in line with the tail exponent 



a, 
a. 



estimated from the Student model. These results indicate that the standardized 



returns obey the (inverse) cubic law. 
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4. 2 Probability distributions of trade-aggregated returns 



We now turn to investigate the distributions of the standardized trade-aggregated 
returns (7 At, where At spans several trades. By varying the value of At, we are able 
to compare the PDF's at different timescales. Specifically, we compare the PDFs 
for At = 2, 4, 8 and 16 trades with that for At = 1 trade. As listed in the second 
column of Table [H the kurtosis of each PDF is significantly greater than that of the 
Gaussian distribution whose kurtosis is 3, indicating a much slower decay in the 
tails. In addition, the kurtosis decreases with respect to the scale At. Very similar 
leptokurtic behavior exists in real stock markets | jj]. 

Table 1 

Characteristic pai^ameters for trade-aggregated returns. 



At 


Kurtosis 


g-Gaussian 


Positive tail 


Negative tail 


L 


ar 


Scaling range 




Scaling range 




1 


19.68 


3.3 


2.9 


1.5 ^ ^ ^ 50.1 


2.87 ±0.02 


1.5 ^ -5 ^ 39.8 


3.06 ±0.03 


2 


19.52 


3.1 


3.0 


1.5 ^ 5 ^ 36.3 


2.90 ±0.02 


1.5 ^ -5 ^ 30.2 


3.15 ±0.04 


4 


17.06 


3.2 


3.1 


1.5 ^ 5 ^ 27.7 


2.94 ±0.03 


1-7 ^ ^ 22.9 


3.24 ±0.04 


8 


13.91 


2.9 


3.2 


1.7 ^ 5 ^ 15.8 


3.07 ±0.04 


1.7 ^ -5 ^ 15.9 


3.42 ± 0.05 


16 


10.72 


2.5 


3.5 


1.7 ^ ^ ^ 12.1 


3.39 ±0.05 


1.9 ^ ^ 7.6 


3.77 ±0.07 




Fig. 5. (Color online) Empirical distributions of aggregated returns at different time scales 
At = 1, 2, 4, 8 and 16. Panel (a): Empirical densities f{g) of the aggregated returns. Panel 
(b): Empirical cumulative distributions for positive (upper cluster of curves) and 

negative (lower cluster of curves) returns g. 

The empirical f{g) functions for different time scales At are illustrated in Fig.[5ta). 
We represent the distribution of one-trade returns for comparison. It is evident that 
the tail is heavier with the decrease of At. This phenomenon can also be character- 
ized by the kurtosis of the distributions. We also notice that the PDF for At = 16 
decays slower than exponential. We have fitted the five curves using the Student 
density model dS]) and the estimated parameters L and ar are listed in Table [B In 
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Fig.[5tb), we study the tail distributions of the normalized returns g. It is observed 
that both positive and negative tails decay in power-law forms. We have estimated 
the tail exponents, which are also presented in Tabled Note that the scaling range 



decreases with increasing At, which is also observed for two Korean indexes ll49ll 



and 23 individual Chinese stocks 114311 . As expected, the tails that are characterized 
by their tail exponents decay faster for larger At, which is consistent with the 
behavior of kurtosis. 

The results obtained so far show that, the mock stock simulated from the MF model 



shares striking similarity in the price dynamics with the Chinese stocks [14311 . How- 
ever, there are also minor discrepancies. First, the tail exponents of the mock stock 
are less than that of the Chinese stocks with about 0.2. Second, the return distri- 
butions are slightly left-skewed while that of the Chinese stocks are slightly right- 
skewed. It is not clear if these discrepancies stem from the simplicity of the MF 
model or are just a reflection of the fact that the parameters of the MF model are 
not extracted from the Chinese stocks. The analysis presented here shows that the 
MF model is very powerful and universal. 



5 Conclusion 



In conclusion, we have studied the return distributions of mock stocks in the Mike- 
Farmer model based on extensive simulations. We found that the power-law tails 
of the return distribution /(r) in the MF model are caused by the power-law tail 
in the left part of the distribution of the relative prices of incoming orders, 

no matter the right part fni^x) has a power-law tail or not. In addition, we found 
that the distributions of returns in the MF model at different timescales in units 
of trades can be modeled by Student distributions, whose tail exponents are close 
to the well-known cubic law and increase with timescale. The behavior of return 
distributions is comparable to that of the real data. 
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